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Abstract — Stochastic contraction analysis is a recently developed tool 
for studying the global stability properties of nonlinear stochastic sys- 
tems, based on a differential analysis of convergence in an appropriate 
metric. To date, stochastic contraction results and sharp associated 
performance bounds have been established only in the specialized 
context of state-independent metrics, which restricts their applicability. 
This paper extends stochastic contraction analysis to the case of general 
time- and state-dependent Riemannian metrics, in both discrete-time 
and continuous-time settings, thus extending its applicabiUty to a 
significantly wider range of nonlinear stochastic dynamics. 

I. Introduction 

Contraction theory provides a body of analytical tools to study the 
stability and convergence of nonlinear dynamical systems [8]. Based 
On a differential analysis of convergence, it allows global stability 
properties of a nonlinear system to be concluded from the system's 
linearization at all points in some appropriate metric. Historically, 
basic convergence results on contracting systems can be traced 
Jjack to the numerical analysis literature [7], [4], [3]. Recently, 
contraction theory has been extended to stochastic differential 
systems [9]. This development has led to a number of practically 
important applications, such as the design of observers for nonlinear 
Stochastic systems [2], or the study of synchronization in networks 
of noisy oscillators [11]. 

' The stochastic contraction theorems have been formulated so 
far in the specialized context of state-independent metrics [9]. 
,Yet, more general state-dependent Riemannian metrics can be 
central to some systems, and in fact the original deterministic 
contraction theorems were derived in this general context [8]. Some 
practical nonlinear dynamics can be most easily studied by choosing 
appropriate state-dependent metrics (cf. e.g. [1], [2]), and from a 
fheoretical perspective, the contraction properties of some systems 
can only be observed in a state-dependent Riemannian metric [10]. 

Recently, an attempt has been made to extend the stochastic 
contraction results of [9] to state-dependent metrics [2]. However, 
since in the estimation of the distance between two trajectories the 
derivation did not consider geodesies between these trajectories but 
instead used straight lines, the bounds obtained are not "optimal" 
(in a sense made precise in Remark 3.3). Here, we prove the 
ptochastic contraction theorems in the case of general time- and 
state-dependent Riemannian metrics by studying the evolution of 
the geodesies under the combined effects of the noise and the 
contracting flow, which allows "optimal" bounds to be obtained. 

In section [III we study the contraction properties of discrete-time 
stochastic difference systems. Then, in section [nil we address the 
case of continuous-time Ito stochastic differential systems by using 
a discrete/continuous limiting argument. Finally, section ITVl offers 
brief concluding remarks. 

II. Discrete stochastic contraction 

We first state and prove a proposition (see also [1]), which makes 
explicit the original deterministic discrete contraction theorem (see 
section 5 of [8]). 



Proposition 1 (and definition): Consider two uniformly positive 
definite metrics Mi = &J@i (i = 1,2) defined over R" and a 
smooth function f ; R" — )■ R". The generalized Jacobian of f in 
the metrics (Mi, M2) is defined by 

da 

Assume now that f is contracting in the metrics (Mi, M2) with 
rate /i, i.e. 

Va G R" An,ax(F(a)^F(a)) < p,, 

where Amax(A) denotes the largest eigenvalue of a given matrix 
A. Then for all a, b G R", one has 

rfM2 (f (a): f (b)) < M^Mi (a, b), 

where dM denotes the distance associated with the metric M. 

Proof: Since Mi is uniformly positive definite, there exists 
a C^-continuous curve (a geodesic) F : [0, 1] — >■ R" such that 
r(0) = a and F(l) = b and 



o^Mi (a, b) = 



du 



[u] 



M.^{T{u))(^(u)]du. 



Next, since f is a smooth function, f(r) is also a C^-continuous 
curve. By the definition of the distance, one then has 



dL,(f(a),f(b))< 



V du 



(u)\ M2(f(F(u))) 



mx), 

du 



(m) J du. 



Remark on the other hand that, by the chain rule, 



di(V) 
du 



(u) 



afar 

aa du 



W, 



which leads to 



i: mi- 



M.{^iu))du 

</>(f"070lf)dK 

= pdli^ (a, b) D 

We now state and prove a proposition which relates metrics and 
noise. 

Proposition 2: Consider a uniformly positive definite metric M 
defined over R". Let cr be a matrix- valued function R" -^ R""*, 
771 , ri2 two independent d-dimensional Gaussian random variables 
with rji ~ ./f (0, 1), and a, b G R". Assume that 

Va G R" tr(cr(a)^M(a)cr(a)) < D, 

then one has E [dM(a + cr(a)r7i,b + (j(b)?72)] < dM(a, b)-f2D. 

Proof: As previously, since M is uniformly positive definite, 
there exists a C^ -continuous curve F : [0, 1] — >■ R" such that 



r(0) = a and r(l) = b and 

Consider the curve F^ : [0, 1] — !> R" defined by 

Vu G [0, 1] r^(u) = r(u) + (1 - M)f7(a)77i + ua{h)Tj2. 

It is clear that F^ is C^-continuous and verifies F,,(0) = a+a{a)'i]i 
and r^(l) = b + a{h)ri2. Thus, by the definition of the distance, 
one has 

dL(a + r?i,b + ?72) 



conditions are given by a probability distribution p{^,£,')- Then 
for all fc > 0, 

2D 



rfM..(afc,bfc) < 



+ M 



dMo(ao,bo) 



1-M 
2D 



M 



dp(ao,bo), (II.2) 



9r\ 

9u 



M l^ 1 du 
au 



M 



-— + (o-(b)772 - cr(a)r7i) du 
au J 

= dL(a, b) + 2(a(b)772 - a(a)77i)^ j m(^\du 



-2(a(b)r,2) 



M.du ((T(a)77i) 



+ / (cr(a)j7i) M(cr(a)j7i)dM 
+ / (cr(b)ry2)^M(cr(b)ry2)dM. 



Remark that the second and third terms of the right-hand side 
vanish when taking the expectation. As for the fourth and fifth 
terms, remark that 



where [■]+ = inax(0, •). 
In particular, for all fc > 0, 

or) ..k 

E [lla,- -b,f] < j^j^^ + yE [di,,{^,0] ■ (113) 

Proof: Taking the conditional expectation given (ao,bo) = x 
and applying (H2d) and Proposition |2] one has 

Ex [dMfc+i(afc+i,bfc+i)J = Ex [diifc+i (f(a, fc) + o-(a,fc)«;fe, 

f(b,fc) + a(b,fcK)] 
< Ex[dii,^,(f(afe),f(b,,))]+2D, 

where w'f. has the same distribution as w^ but is independent of the 
latter. 

On the other hand, from (Hdl) and Proposition [T] one has 

Ex [di,^^^(f(aO,f(bfc))] < mEx [di,Jafe,bfc)] ■ 

If one now sets it/.. = Ex [djvifc (a^, b^)] then it follows from the 
above that 

lifc+i < /iiife + 2D. (II.4) 

Define next Vk = Uk — 2D/(1 — /i). Then replacing Uk by 
Vk. + 2D/(1 — fi) in JII.4t leads to v^+i < fJ^Vk- This implies that 
Vfc > 0, Vk < vofi'' < [vo]'^ fi'' ■ Replacing v^ by its expression in 
terms of Uk then yields 



(a(a)7)i) ' M(a(a)r?i) = tr (^(a(a)r,i) ' M(a(a)r7i)^ 

= tr (riJ a{a)^Ma{a)rii) = tr(77i' Qt^i) 



Vfc > Uk< 



2D 



2D 



UQ 



1-M 



where Q is obtained from cr(a)^M(7(a) by an orthogonal diago- 
nalization. One thus has 



E 



[tr(r77Qr,i)] = tr(Q) = tr(a(a)^Ma(a)) < D, 



which allows to conclude D 

We can now state and prove the discrete stochastic contraction 
theorem. 

Theorem 1: Consider the stochastic difference equation 



1-M 

'integrating the last inequality with respect to x leads to ill.2l . 
Finally, ni.3t follows from l lll.2l l by remarking that 

/2D 1 ^ 
^Mo (ao, bo) - Y^— dp(ao, bo) < 

/ ^Mo (ao, bo)dp(ao, bo) = E [dLo (C, C')] . (H-S) 



and that, ||afc - bfc||^ < -dM^(afc, bfe) D 



(II.6) 



(II. 1) 



afc+i = f (afc, fc) + a{a.h,k)wk+i 
ao = ?, 

where f is a R" x N -S- K" function, a is a R" x N -> R"'' 
matrix-valued function, {wk)k£N is a sequence of independent d- 
dimensional Gaussian noise vectors, with Wk ~ ^(0, 1) and ^ is 
a n-dimensional random variable independent of the Wk- 

Assume that the system verifies the following two hypotheses: 

(Hdl) for all fc > 0, the dynamics f (a, fc) is contracting in the 
metrics (Mfe,Mfe+i), with contraction rate /i (0 < /i < 
1), and the metrics Mfe(a) are uniformly positive definite 
in a and fc, with lower bound /3, i.e. 

Vfc > 0, a G R" a^Mfc(a)a > /3||af ; 

(Hd2) tr ((T(a, fc)^M(a, fe)CT(a, fc)) is uniformly upper- 
bounded by a constant D. 

Let (afc)fcgN and (bA;)^^^ be two trajectories whose initial 



Remark 2.1 [Relaxing the uniform hound on the noise]: 
Assume that the initial conditions are contained in a region U, 
then (Hd2) can in fact be replaced by [5] 

Vfc > 0, a G C/ E [tr {(j{slu, fc)^Mfc(afe)cr(afe, fc)] | ao = al < D. 

III. Continuous stochastic contraction 

Based on the discrete stochastic contraction theorem just es- 
tablished, we can now state and prove the continuous stochastic 
contraction theorem in general Riemannian metrics. 

Consider the Ito stochastic differential equation 

da = f (a, t)dt + a{&, t)dW 
a(0) = t 



(III.l) 



To ensure existence and uniqueness of solutions to equation 
JII.U . we assume the following standard conditions on f and a: 
Lipschitz condition: There exists a constant Ki > such that 



Vi > 0, a. b G 



lf(a,t)-f(b,t)|l + |la(a,f)-a(b,i)|l < 7^i|la-b|l 



Restriction on growth: There exists a constant K2 > such that 

Vt>0, aGR" ||f(a,t)||' + ||o-(a,f)f <iC2(l + ||af). 



Theorem 2: Assume that system JIII.ll l verifies the following two 
hypotheses: 

(Hcl) for all i > 0, the dynamics f (a, t) is contracting 
in the time- and state-dependent metric M(a, f) — 
0^(a, i)©(a, f), with contraction rate A (A > 0), i.e. 



A, 



Vi > 0, a G R" 
©(a,t) + ©(a,i) — j©(a,f)-^ 



<-A, 



where As = ^(A^ + A) denotes the symmetric part 
of a given matrix A. Furthermore, the metric M(a, i) is 
positive definite uniformly in a and t, with lower bound 

/?; 

(Hc2) tr (a(a, i)^M(a, i)(T(a, f)) is uniformly upper-bounded 
by a constant C. 

Let a(t) and b(f) be two trajectories whose initial conditions are 
independent of W and given by a probability distribution p{£,,C)- 
Then for all T > 0, 

C 

A 

C 



E[dL(T)(a(r),b(r))] < 



+e 



M{0) 



(ao,bo) - 



dp(ao,bo).(III.2) 



In particular, for all T > 0, 

E[||a(r)-b(r)f]<^ + 



-E d 



Proof: Fix (a(0),b(0)) = x G ™^'' 



M{0) 



(e,c') 



(III.3) 



and T > 0. We first 
discretize the time interval [0, T] into TV equal intervals of length 
5 = T/N and consider the two sequences {a.^)ken, (bj,)fcgN 
defined by 

a|+i =si+ 5f (aL fc<5) + a(af., kS)wi 
4 = a(0) 



bg = b(0) 



bi + 5f(b*,M)+a(bi,HK^ 



(111.4) 



where {wj,)k,zj,i and (luj. )k&i are two sequences of random vari- 
ables defined by «;f = W{(k + l)S)-W{k5)2indw'i = W'({k + 
1)5) — W'{k5). Note that, since W and W' are two independent 
Wiener processes, {wk)ken and {w'i^)ken are two sequences of 
independent Gaussian random variables with distribution N{0, 51). 
Note also that, by the strong convergence of the Euler-Maruyama 
scheme (cf. [6], p. 342), one has 



limEx[||alr-a(T)||2] = 0. 



(III.5) 



Hypothesis (Hc2) implies that system ( IIII.4I I satisfies (Hd2) with 
D = 5C. To verify (Hcl), denote by Gfe(a) the generalized 
Jacobian matrix of ( IIII.4t at step k. Denoting t — k5, one has 

G.(a)^Q(a,t + ^) ^^"+f("'^)) ©(a,t)-^ 

^&{t + 5)(l + 5^)&{t)-\ 
Remark that we have dropped the argument a for convenience. One 



can next rewrite Gj, Gk ~ Aq + 5Ai, with 



Ao 
Ai 



(©(t)-i)^ ©(f + 5)^&{t + 5)&(t)-^; 
5{@{t)~'y{@{t + 5)^&(t + 5)§^ 
+ {^y&it + 5)^&{t + 5))&{t)-\ 



Using the Taylor expansion @{t + 5) ^&{t) + 5&{t) + 0{5'^) 
leads to 

Ao = I + 25{&{t)&{t)-^^)s+0{5^); 
5A, = 5{@{t)-'y(&{t)^&{t)§^ + {^)^&{t)^@{t))@{t)-' 
+0(5^) 
= 5 (©(t)||©(i)-i + {@{t)'y [fy &it)^) + 0{5'') 



= 25(©(i)||©(t)-i)^ + 0(52). 
Summarizing the previous calculations, one has 

Gfc Gfe = I - 



25 (^(@{t) + @{t)^^ @{t)-'^ +0{5'), 



Thus, the hypothesis (Hcl) that f is contracting in the metric M 
with rate A implies 

A^,,(G^Gfe)<l-25A + e(5), 

with lim^^o -g = 0- Letting fi{5) = 1 — 25\ + e{5), one then 
has that /i < 1 for 5 sufficiently small, which in turn means that 
system | |III.4I ) satisfies (Hdl). Applying the discrete contraction 
theorem for k = N leads to 

25C 



Ex [dM„(air,b^)J < Y 



-mW 



+m('5)' 



di,„(a(0),b(0)) 



25C 



(III.6) 



1 - KS) 
On the other hand, one has, by the triangle inequality, 

Ex[dL(T)(a(r),b(T))] < Ex[dL„(a^,b^ 

+ Ex[dL„(a?„a(r)) 

+ Ex[dL„(b^,b(T))]. 

From equation ( Illl.St . the second and third terms of the right- 
hand side vanish when 5 — > 0. As for the first term, remark that 



2SC 



2SC 



C 



l-/j(<5) 26\-€{S) A+e{(5)/(5 

m(<5)^ = (1 - 25\ + e{5))^/' = ^i(-2sx+.(sy) 
One can thus conclude, by letting (5 — )■ 0, that 



c. 

A ' 



Ex[dL(T)(a(r),b(T))] <^+e 



A 



dL(o)(a(0),b(0))-y 



Integrating with respect to x then leads to the desired result ( IIII.2t . 
Finally, l lIII.3t follows from l lIII.2t by the same calculations as in 
l HOI l and QTeJ □ 

Remark 3.1 [Noisy and noise-free trajectories]: If (a, b) 
represent in fact a noisy and a noise-free trajectories then the bounds 
( IIII.2b and 011.31 ) are replaced by analogous bounds where C is 
replaced by C/2 (cf [9]). 

Remark 3.2 [Relaxing the uniform bound on the noise]: As in 
Remark 2.1, if the initial conditions are contained in a region U, 
then (Hc2) can in fact be replaced by 

Va G (7 Vit > 

E [tr ('cr(a(f),i)^M(a(f),i)cr(a(i),f)') | a(0) = al < C. 



Remark 3.3 ["Optimality" of the mean square bound]: If M 
is in fact state-independent, then tlie bound I IIII.2I ) is tlie same as 
that obtained in [9] (cf. Theorem 2 of that reference), which means 
that this bound is "optimal", in the sense that it can be attained 
(cf. section III-A of [9]). This contrasts with the bound obtained 
in [2] (cf. Lemma 2 of that reference), which has the same form 
as 1 IIII.2I ) but with different constants Ai and Ci, defined - using 
our notations ~ as follows: 



Ai = A 



e 



Ci = c + 



2e 



where a is a uniform upper-bound on the Frobenius norm of the 
matrix a (a, t), m is a uniform upper-bound on ||M(a, f)||, and e is 
a positive constant. Note that, for any choice of e, one has Ai < A 
and C\ > C, which yield a strictly looser bound compared to ( IIII.2t 
. Moreover, if e is small, Ai gets closer to A, but C\ becomes very 
large. On the other hand, if e is large, Ci gets closer to C, but Ai 
becomes very small. Thus, there is no value of e for which Ai and 
C\ are arbitrarily close to A and C respectively - and in practice, 
the difference between C\ and C can be extremely large because 
of the uniform upper-bounds a and fh. 

Example: Following [10], consider the following system 



il — X2\ 1+ x\ 



X2 = 



—x-i_x\ 



y = xi. 



Construct the observer 



Xl 



■ X2 



(£1 - y) 



X2 



-(£1 -y), 



Xl = Xl 



X2 = 



X2 



y/i + ^1 



(111.7) 



(III.8) 



(111.9) 



Note that this observe r differs from that of [10] : the denominator 

in JIII.9I ) is Y 1 + Xl instead of yl + j/^. The observer of [10] is 
interesting in that it is contracting in no state-independent metric (cf. 
Example 2.5 of that reference). It can be shown that this property 
is shared by the modified version ( IIII.8t -( |ni.9l ). 

Differentiating ( |III.9I > and replacing xi and £2 by their expres- 
sions in terms of xi, £2, y, one obtains 



Xl = Xl = X2 ~ {xi ~y) = X2J1 + xf — (xi — y) 
X2 a;2£ixi 



X2 = 



{xi - y) ixiX2 - vTTif j 



(III. 10) 



xix\ 



l + £? 



Observe that (a;i, 2:2) is a particular solution of ( Illl.IOt . To show 
the contraction behavior of ( Illl.lOt . consider the following nonlinear 
transform 



Xl 



-3£i +5i2\/l + *!, 
X2 — 3£i + 2x2 •( 
From dill. 9b . one has 

(xi,S2)^ 



'1+xf. 

P • (£i,£2) , 



(III. 11) 



where P is the 2x2 constant matrix 



Thus 



(ii,i2) =P-(xi,£2) =PQ-(£i,X2) = PQP ^■{xi,X2) , 
where the second inequality comes from ( Illl.IOt with Q — 



A numerical computation shows that the eigenvalues 
are (-0.24,-0.76), which 



1 1 

1 ^ 
of the symmetric part of PQP^^ 
means that system (s:i,a;2) is contracting with rate 0.24 in the 
identity metric. From ( |III.9t , one finally has that system (11,12) 
is contracting with rate 0.24 in the metric 


IM = P ' P©, where 



^A+5 



VT+^ 



Let us now study the convergence properties of the observer when 
the measure yp is corrupted by white noise as j/p = j/ + S^, where 
y = Xl is the unperturbed measure, ^ is a "white noise" of variance 
1 and S is the noise intensity. Using the formal rule dW = ^dt, 
equations l llll.St are transfoimed into 

dii = (x2- (£1 -y))dt + 5'dVK 

d£2 = -(£i-y)dt + S'dVK. (11112) 

The observer equations ( Illl.IOt become 



dxi 



dX2 = 



X2 



- S 



1 + xf - (Xi ^ 
(xi — y) f X1X2 



y) 



At + SAW 



y/TT: 



l + xi 



+ 



Xixi 



Vi + *f 



At 



X1X2 ■ 



ymf 



AW. 



One is now in the settings of Theorem |2] with 

ct(xi,X2) 



S,S 



X1X2 



\/rn 



l + xf 



From the above expression, it can be shown algebraically that 
sup„i,a(o,6)^]V[(a,6)cr(a,b) = 15.25^ 

We now make the assumption that ||x2|| is uniformly upper- 
bounded by a constant B (which can indeed be shown using an 
independent method, see also simulations in Fig. [T}. Then, it can 
be shown that, uniformly, 

||0'^P'^P0xf > 7(B)||xf . 



One thus can apply Theorem |2] and obtain the bound d III. 31 ) with 
A = 0.24, C = 15.25^ and /3 = 7(B). Note that, for t -^ cx), 
one has X2 — >■ 0, such that one has the bound B = 0, which in 
turn corresponds to 7(5) — 12.95. The bound after exponential 
transients is then given by (cf . Fig. [T] for numerical simulations) 

C 



2/3A 



= 2.455'^ 



(HI. 1 3) 



IV. Conclusion 

We have established the stochastic contraction theorems in the 
case of general time- and state-dependent Riemannian metrics. 
In the limit when the metric becomes linear (state-independent), 
the bounds we derived are the same as those obtained in [9], 
which means that they are "optimal", in the sense that they can 
be attained. This development allows extending the applicability 
of contraction analysis to a significantly wider range of nonlinear 
stochastic dynamics, such as stochastic observers or networks of 
noisy nonlinear oscillators. 
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Fig. 1. Simulations for the observer studied in tlie text. A: evolution of 
the systems for t G [Os, 5 s]. Equations jIII.7t were integrated using the 
Euler method with time step At = 0.01s (red line: x^; blue line: X2)- 
Equations till. 12t were integrated using the Euler-Maruyama scheme (cf. 
[6]) with the same time step At = 0.01 s. We plotted 20 sample trajectoiies 
for noise intensity 5 = 1 starting from the same deterministic initial values 
(xi(0), X2{0)) (magenta lines: x\; cyan lines: X2)- B: evolution of the 
systems for t G [5 s, 15 s]. Note that, for clarity, the values of X2 and £2 
were multiplied by 400 in this plot. To assess the theoretical bounds, we 
plotted the sample mean square error {xi—xi)'^ + {x2—X2)'^ (plain green 
line) and the theoretical bound after transients given by equation )III.13t 
(dashed green line). For clarity, these values were multiplied by 10. 
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